E-msNFIS: An Efficient Method for Mining Negative Frequent Itemsets based on Multiple Minimum Supports
نویسندگان
چکیده
Negative Frequent Item Sets (NFIS) like (a1a2¬a3a4) have played important roles in real applications because many valued negative association rules can be found from them. Very few methods are available for mining NFIS and most of them only use single minimum support, which implicitly assumes that all items in the database are of the same nature or of similar frequencies in the database. This is often not the case in real-life applications. Several methods are available for mining frequent itemsets with Multiple Minimum Supports (MMS), but these methods only mine Positive Frequent Item Sets (PFIS), doesn’t consider NFIS. So in this paper, we propose a new and efficient method, named emsNFIS, to mine NFIS with MMS. To the best our knowledge, e-msNFIS is the first method to mine NFIS with MMS and to deal with the problem of how to set up the minimum support to an itemset with negative item(s). E-msNFIS contains three steps: 1) using a classical algorithm MSapriori to mine PFIS with MMS; 2) using the method in e-NFIS to generate Negative Candidate Item Sets (NCIS) based on the PFIS got in step 1; and 3) calculating the support of these NCIS only by using the supports of PFIS and then getting NFIS. Experimental results on real datasets show that the e-msNFIS is very efficient.
منابع مشابه
Mining association rules with multiple minimum supports: a new mining algorithm and a support tuning mechanism
Mining association rules with multiple minimum supports is an important generalization of the association-rule-mining problem, which was recently proposed by Liu et al. Instead of setting a single minimum support threshold for all items, they allow users to specify multiple minimum supports to reflect the natures of the items, and an Apriori-based algorithm, named MSapriori, is developed to min...
متن کاملAn Enhanced Frequent Pattern Growth Based on Mapreduce for Mining Association Rules
In mining frequent itemsets, one of most important algorithm is FP-growth. FP-growth proposes an algorithm to compress information needed for mining frequent itemsets in FP-tree and recursively constructs FP-trees to find all frequent itemsets. In this paper, we propose the EFP-growth (enhanced FPgrowth) algorithm to achieve the quality of FP-growth. Our proposed method implemented the EFPGrowt...
متن کاملMining association rules with multiple minimum supports using maximum constraints
Data mining is the process of extracting desirable knowledge or interesting patterns from existing databases for specific purposes. Most of the previous approaches set a single minimum support threshold for all the items or itemsets. But in real applications, different items may have different criteria to judge its importance. The support requirements should then vary with different items. In t...
متن کاملE-fwarm: Enhanced Fuzzy-based Weighted Association Rule Mining Algorithm
In the Association Rule Mining (ARM) approach, equal weight is assigned to all itemsets in the dataset. Hence, it is not appropriate for all datasets. The weight should be assigned based on the significance of each itemset. The WARM reduces extra steps during the generation of rules. As, the Weighted ARM (WARM) uses the significance of each itemset, it is applied in the data mining. The Fuzzy-b...
متن کاملFast Algorithm for Mining Generalized Association Rules
In this paper, we present a new algorithm for mining generalized association rules. We develop the algorithm which scans database one time only and use Tidset to compute the support of generalized itemset faster. A tree structure called GIT-tree, an extension of IT-tree, is developed to store database for mining frequent itemsets from hierarchical database. Our algorithm is often faster than MM...
متن کامل